Document Structure
نویسندگان
چکیده
We argue the case for abstract document structure as a separate descriptive level in the analysis and generation of written texts. The purpose of this representation is to mediate between the message of a text (i.e., its discourse structure) and its physical presentation (i.e., its organization into graphical constituents like sections, paragraphs, sentences, bulleted lists, figures, and footnotes). Abstract document structure can be seen as an extension of Nunberg’s “text-grammar”; it is also closely related to “logical” markup in languages like HTML and LaTEX. We show that by using this intermediate representation, several subtasks in language generation and language understanding can be defined more cleanly.
منابع مشابه
Persian Printed Document Analysis and Page Segmentation
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifyi...
متن کاملA policy framework for the challenges of implementing regional higher education management in Iran
The models of regional governance in the world, particularly for administration of higher education are considered vital. In Iran, with the approval of Iran's Higher Education System Spatial Management Document, the issue of regional management in higher education was given special attention. Articles 1 and 2 of the document specifically address the regional higher education structure of the ...
متن کاملStrategies for promoting the Supervisory board Subject of Article 6 of the Registration Law Emphasizing the Transformation Document of the Judiciary
Abstract The Supervisory Board (Article 6 of the Law on the Registration of Deeds and Property) is the authority to deal with disputes and errors regarding the registration of documents and property. This reference lacks a procedure. The current method of handling this reference is incomplete and contrary to the policy of reducing the work of the court. If we want to make minor reforms in the ...
متن کاملAutomatic Workflow Generation and Modification by Enterprise Ontologies and Documents
This article presents a novel method and development paradigm that proposes a general template for an enterprise information structure and allows for the automatic generation and modification of enterprise workflows. This dynamically integrated workflow development approach utilises a conceptual ontology of domain processes and tasks, enterprise charts, and enterprise entities. It also suggests...
متن کاملAutomatic Workflow Generation and Modification by Enterprise Ontologies and Documents
This article presents a novel method and development paradigm that proposes a general template for an enterprise information structure and allows for the automatic generation and modification of enterprise workflows. This dynamically integrated workflow development approach utilises a conceptual ontology of domain processes and tasks, enterprise charts, and enterprise entities. It also suggests...
متن کاملبررسی استانداردهای ساختار، محتوا و واژهنامه پرونده الکترونیک سلامت در سازمانهای منتخب و ارائه الگوی مناسب برای ایران
Introduction: Electronic health record (EHR) is defined as digitally stored healthcare information about an individual's life time with the purpose of supporting continuity of care, education, and research. Major issue that needs to be addressed in order to accomplish with sharing and exchange is the development and use of content and structure standards in the EHR. Based on, this investigation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Linguistics
دوره 29 شماره
صفحات -
تاریخ انتشار 2003